Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment

نویسندگان

چکیده

We examined the validity of 37 sentiment scores based on dictionary-based methods using a large news corpus and demonstrated risk generating spectrum results with different levels statistical significance by presenting an analysis relationships between U.S. presidential approval. summarize our findings into four best practices: 1) use suitable dictionary; 2) do not assume that reliability dictionary is ‘built-in’; 3) check for influence content length 4) multiple dictionaries to test same hypothesis.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Large-scale news entity sentiment analysis

We work on detecting positive or negative sentiment towards named entities in very large volumes of news articles. The aim is to monitor changes over time, as well as to work towards media bias detection by comparing differences across news sources and countries. With view to applying the same method to dozens of languages, we use linguistically light-weight methods: searching for positive and ...

متن کامل

Expanding Chinese Sentiment Dictionaries from Large Scale Unlabeled Corpus

Unsupervised sentiment classification usually needs a user defined sentiment dictionary. However, the existing dictionaries in Chinese are insufficient, for example, the intersection rate of two popular Chinese sentiment dictionaries HowNet and NTUSD is less than 10%. In this paper, we present a method to help expand the dictionaries with more sentiment words by ranking them through link analys...

متن کامل

Large-Scale Sentiment Analysis for News and Blogs

Newspapers and blogs express opinion of news entities (people, places, things) while reporting on recent events. We present a system that assigns scores indicating positive or negative opinion to each distinct entity in the text corpus. Our system consists of a sentiment identification phase, which associates expressed opinions with each relevant entity, and a sentiment aggregation and scoring ...

متن کامل

Large-Scale Sentiment Analysis for News and Blogs (system demonstration)

News can be good or bad, but it is seldom neutral. Although full comprehension of natural language text remains well beyond the power of machines, the statistical analysis of relatively simple sentiment cues can provide a surprisingly meaningful sense of how the latest news impacts important entities. Here we demonstrate our large-scale sentiment analysis system for news and blog entities built...

متن کامل

Organizing Large Scale Hacking Competitions

Computer security competitions and challenges are a way to foster innovation and educate students in a highly-motivating setting. In recent years, a number of different security competitions and challenges were carried out, each with different characteristics, configurations, and goals. From 2003 to 2007, we carried out a number of live security exercises involving dozens of universities from a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computational communication research

سال: 2021

ISSN: ['2665-9085']

DOI: https://doi.org/10.5117/ccr2021.1.001.chan